Visualization
Data Visualization is the second step in a Rapid Process Troubleshooting project. During this step, data is visualized and prepared for modelling. Visual and statistical techniques are used to prepare the data that will be used in the following steps to build models. Thus it is important to use good quality data as well as data that is an accurate representation of the process.
In this step, you can:
-
Set limits for process variables.
-
Decorrelate the dataset using a correlation matrix.
-
Visualize data using:
-
Identify trends and clusters in the dataset with the regional brushing capabilities.
Accessing the Visualization view
You can access the Visualization view from the troubleshooting context bar located to the left of the Discrete & Batch Troubleshooter project view. The Visualization view becomes active when the minimum requirements for the Data Preparation step are met. These include importing a dataset, and loading the data into the Troubleshooter project.
The Visualization view
The Visualization view provides the following information and options:
-
Field
This is the field name as configured during the data import stage. -
Category
Fields are classified into different categories, such as a target field, a process state, a disturbance, or an adjustable field type.
-
Histogram icon
Click the histogram icon provided on the list view to view a histogram showing the distribution of the selected field. -
Trend icon
Click the trend icon provided on the list view to view a trend against the specified index across the entire dataset for the selected field. -
Limits
Select Enable or Disable to enable or disable the limits set for each field. Setting this option to Enabled will set the qualities of all tag values that exceed either the High high or the Low low limits to bad. All bad quality data points will not be used during model construction and only good quality data (data that falls within the High high and Low low limits) will be used for modelling purposes. -
Low Low
This is the minimum value contained in the loaded data set for the given field. -
Low
The Low limit is a secondary limit that indicates the "operational" lower limit for a given field. -
High
The High limit is a secondary limit that indicates the "operational" higher limit for a given field. -
High High
This is the maximum value contained in the loaded data set for the given field -
Unique
This is the number of unique values contained in the loaded data set for the given field.
For any field containing more than 1000 unique values across the entire data set, the view simply displays ">1000" for the given field.
Visualization Panel
The following options are available as buttons below the field list on the Preparation View:
-
Multiple Histogram
Allows for opening multiple histograms from a field selection dialog. -
Multiple Trends
Allows for opening multiple trends from a field selection dialog. -
Field Correlation
Opens the Correlation matrix for decorrelation of the data set. -
Remove Models
Quickly remove all models from your project.
NOTE: Limits cannot be set if a model has been constructed. Click the Remove Models button to remove all models from the project.
Related Topics: